Folding membrane proteins by deep transfer learning

نویسندگان

  • Sheng Wang
  • Zhen Li
  • Yizhou Yu
  • Jinbo Xu
چکیده

Computational elucidation of membrane protein (MP) structures is challenging partially due to lack of sufficient solved structures for homology modeling. Here, we describe a high-throughput deep transfer learning method that first predicts MP contacts by learning from non-MPs and then predicts 3D structure models using the predicted contacts as distance restraints. Tested on 510 non-redundant MPs, our method has contact prediction accuracy at least 0.18 better than existing methods, predicts correct folds for 218 MPs, and generates 3D models with root-mean-square deviation (RMSD) less than 4 and 5 Å for 57 and 108 MPs, respectively. A rigorous blind test in the continuous automated model evaluation project shows that our method predicted high-resolution 3D models for two recent test MPs of 210 residues with RMSD ∼2 Å. We estimated that our method could predict correct folds for 1,345-1,871 reviewed human multi-pass MPs including a few hundred new folds, which shall facilitate the discovery of drugs targeting at MPs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting membrane protein contacts from non-membrane proteins by deep transfer learning

Computational prediction of membrane protein (MP) structures is very challenging partially due to lack of sufficient solved structures for homology modeling or parameter estimation of computational methods. Recently direct evolutionary coupling analysis (DCA) sheds some light on protein contact prediction and accordingly, contact-assisted folding, but DCA is effective only on some very large-si...

متن کامل

TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions

Although deep learning approaches have had tremendous success in image, video and audio processing, computer vision, and speech recognition, their applications to three-dimensional (3D) biomolecular structural data sets have been hindered by the geometric and biological complexity. To address this problem we introduce the element-specific persistent homology (ESPH) method. ESPH represents 3D co...

متن کامل

Nascent Membrane and Secretory Proteins Differ in FRET-Detected Folding Far inside the Ribosome and in Their Exposure to Ribosomal Proteins

Fluorescence resonance energy transfer measurements reveal that a transmembrane sequence within a nascent membrane protein folds into a compact conformation near the peptidyltransferase center and remains folded as the sequence moves through a membrane bound ribosome into the translocon. This compact conformation is compatible with an alpha helix because nearly the same energy transfer efficien...

متن کامل

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model

MOTIVATION Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction. METHOD This ...

متن کامل

Constraints on lateral gene transfer in promoting fimbrial usher protein diversity and function

Fimbriae are long, adhesive structures widespread throughout members of the family Enterobacteriaceae. They are multimeric extrusions, which are moved out of the bacterial cell through an integral outer membrane protein called usher. The complex folding mechanics of the usher protein were recently revealed to be catalysed by the membrane-embedded translocation and assembly module (TAM). Here, w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Cell systems

دوره 5 3  شماره 

صفحات  -

تاریخ انتشار 2017